Blog posts recommendation based on PLSA and Naive Bayesian classification algorithm

نویسندگان

  • Lin Cui
  • Caiyin Wang
  • Xiaoyin Wu
چکیده

As one of the important applications of Web2.0 technology, blog attracts more and more users. Writing and browsing blog has become a popular hotspot of network culture, which promotes the development of blog search service. But, the current blog search engines are mostly only based on matching query keywords; lack the ability of automatically extracting users’ interests and recommendation. Really Simple Syndication (RSS) is a format of describing website and keeping synchronization with website content. Using RSS to aggregate blog posts has the advantage of letting users get the latest update of blog posts. However, the posts collected by RSS don’t always attract users; users still need to browse every subscription post to find the interesting posts. To address this problem, the time spent by users on reading blog posts is viewed as a key factor to measure the users' interests. In this paper, we firstly used probabilistic latent semantic analysis (PLSA) to discovery the topics of blog posts, then adopted Naive Bayesian algorithm to classify the blog posts which was primarily connected with the users’ reading time, and lastly ranked and recommended the unread interesting posts to users. Experiments showed that our proposed method could recommend the favorite blog posts to users according to the users' browsing interests.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Validation Test Naive Bayesian Classification Algorithm and Probit Regression as Prediction Models for Managerial Overconfidence in Iran's Capital Market

Corporate directors are influenced by overconfidence, which is one of the personality traits of individuals; it may take irrational decisions that will have a significant impact on the company's performance in the long run. The purpose of this paper is to validate and compare the Naive Bayesian Classification algorithm and probit regression in the prediction of Management's overconfident at pre...

متن کامل

Micro-blog Personalized Query Expansion Based on Latent Topic Classification

With the increasing maturity of Web2.0 technology and development of micro-blog, the number of micro-blog pages is exponentially rising. Only relying on the traditional micro-blog search engine has not met the requirements of users. Aiming at that the retrieval efficiency of the traditional micro-blog searching method cannot meet the requirements of users, inspired by probabilistic latent seman...

متن کامل

‎A Bayesian mixture model‎ for classification of certain and uncertain data

‎There are different types of classification methods for classifying the certain data‎. ‎All the time the value of the variables is not certain and they may belong to the interval that is called uncertain data‎. ‎In recent years‎, ‎by assuming the distribution of the uncertain data is normal‎, ‎there are several estimation for the mean and variance of this distribution‎. ‎In this paper‎, ‎we co...

متن کامل

Uncertainty Modeling of a Group Tourism Recommendation System Based on Pearson Similarity Criteria, Bayesian Network and Self-Organizing Map Clustering Algorithm

Group tourism is one of the most important tasks in tourist recommender systems. These systems, despite of the potential contradictions among the group's tastes, seek to provide joint suggestions to all members of the group, and propose recommendations that would allow the satisfaction of a group of users rather than individual user satisfaction. Another issue that has received less attention i...

متن کامل

Consumer attitudes toward blogger's sponsored recommendations and purchase intention: The effect of sponsorship type, product type, and brand awareness

Sponsored recommendation blog posts, a form of online consumer review, are blog articles written by bloggers who receive benefits from sponsoring marketers to review and promote products on their personal blog. Because national regulations require that marketer sponsorship must be revealed in the blog post, sponsored recommendation posts can no longer conceal their marketing intent. Consumer’s ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014